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Method And Apparatus For PMD/PDL/PDG Mitigation 
Cross-Reference to Related Application 

This application is related to U. S. patent applications entitled "System And 
Method For Multi-Channel Mitigation of PMD/PDL/PDG" filed on July 31, 2003 and 
5 August 3 1 , 2003, which are incorporated herein by reference. 
Field of the Invention 

The present invention relates to optical communications, and more specifically to 
polarization scrambling methods and apparatus for mitigation of polarization-mode- 
dispersion (PMD), polarization-dependent loss (PDL), and polarization-dependent gain 
10 (PDG) penalties in optical communication systems. 
Background of the Invention 

PMD, PDL, and PDG are significant penalty sources in high-speed (e.g., 10 Gb/s 
and 40 Gb/s) transmissions. PMD compensation (PMDC) is normally desirable to 
increase system tolerance to PMD. However, due to the stochastic nature of PMD and its 
15 wavelength dependence, PMDC is normally implemented for each wavelength channel 
individually, and is thus, generally not cost-effective. Various prior art methods have 
been proposed to achieve PMDC simultaneously for multiple WDM channels. Channel 
switching is one technique that has been proposed to mitigate the overall PMD penalty in 
a WDM system. However, such systems sacrifice system capacity due to the use of extra 
20 channels for PMD protection. Multi-channel PMDC before wavelength de-multiplexing 
has also been proposed to compensate for the PMD degradation in the WDM channel 
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having the most severe PMD. However, such a mitigation scheme may cause 
degradation of other channels. 

Another scheme for a multi-channel shared PMDC has been proposed in which 
the most degraded channel is switched, by optical or electrical means, to a path connected 
5 to the shared PMDC. However, the speed of such a PMDC solution is limited by the 
speed of the optical or electrical switching. 

In each of the prior art PMDC schemes discussed above, undesirable PMD 
induced system outages (during which the PMD penalty exceeds its pre-allocated system 
margin) are present, though reduced. 
10 Forward-error-correction (FEC) is an effective technique for increasing system 

margin cost-effectively. It has been determined, however, that FEC cannot extend the 
tolerable PMD for a fixed PMD penalty at a given average bit-error-rate (BER), even 
though the additional margin provided by FEC can be used to increase the PMD 
tolerance. It has been suggested that sufficient interleaving in FEC may increase PMD 
15 tolerance. However, there is no known practical method to provide the deep interleaving 
needed to avoid a PMD outage, which may last minutes or longer in practical systems. 

Although distributed polarization scrambling has been proposed as an effective 
technique for mitigation of PMD/PDL/PDG penalties, a need exists for optimized and 
cost effective designs for providing polarization scrambling to mitigate PMD/PDL/PDG. 
20 Summary of the Invention 

The present invention provides a polarization scrambling method and apparatus 
for multi-channel PMD/PDL/PDG mitigation. 
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According to one embodiment, the present invention a polarization scrambler 
apparatus is provided for use in nodes of an optical communication system. The 
apparatus comprises M polarization controllers and drive circuitry adapted to drive the M 
polarization controllers at at least one of a plurality of frequencies fj f M , wherein 
5 fi>f 2 ...>f M , and f x > about BR / (BECL x N). BR is the highest bit rate of the optical 
signal, BECL is a maximum burst error correction length of forward error correction used 
in the optical communication system and N is the number of nodes having one or more 
polarization controllers. 

According to another embodiment, an optical communications method is provided 
10 for use in at least one of N nodes of an optical communication system. The method 
comprises driving M polarization controllers at at least one of a plurality of frequencies f, 
- f M such that f,>f 2 . . >f M; and f, > about BR / (BECL x N). BR is the highest bit rate of 
the optical signal being transmitted, and BECL is a maximum burst error correction 
length of forward error correction used in the optical communication system. 
15 Brief Description of the Drawing 

The foregoing and other objects, features and advantages of the present invention 
will become more apparent from the following detailed description when taken in 
conjunction with the appended drawings. 

It is to be noted, however, that the appended drawings illustrate only exemplary 
20 embodiments of the invention and are therefore not to be considered limiting of the scope 
of the invention. 
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FIGS. 1 A-D are plots illustrating a working principle of embodiments of the 
present invention; 

FIG. 2 is a diagram depicting one embodiment of a system according to the 
invention; 

5 FIG. 3 is a diagram depicting another embodiment of a system according to the 

invention; 

FIG. 4 is a diagram of an embodiment of a polarization scrambler module 
according to the invention; 

FIG. 5 is a diagram showing an embodiment of drive circuitry that can be used in 
1 0 accordance with the invention; 

FIGS. 6 A-D are plots showing the Maxwellian distribution of a link differential- 
group-delay (DGD); the link DGD distributions during an outage event with one 
polarization scrambler module (PSM) in the middle of the link; and the DGD 
distributions of the first and second half of the link during the outage, respectively; 
15 FIGS. 7 A-B are plots showing the distribution of the link DGD distributions 

during an outage with 2 and 6 PSMs, respectively; 

FIG. 8 is a plot showing the outage probability (OP) vs. the number of PSMs 
assuming idealized PMD scrambling (dotted line) and with insufficient scrambling speed 
(dashed line); 

20 FIGs. 9 A-B are plots showing relative required OSNR to achieve BER=10" 15 as a 

function of PMD in NRZ-OOK (left) and NRZ-DPSK (right) systems without FEC 
(circles), with FEC and no PSMs (squares), and with FEC and PSMs (diamonds); and 
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FIG. 10 is a plot showing the dependence of corrected BER (by FEC) on 
uncorrected BER. 

Detailed Description of the Invention 

In one aspect of the present invention methods and apparatus are employed to 

5 provide polarization scrambling (for use in conjunction with FEC) to change the 
polarization of a signal between at least two states during each burst-error-correcting 
period (BECP) of the FEC. By changing polarization at least once during each BECP the 
PMD experienced by a signal is varied during each BECP, and the PMD induced 
"outages" are effectively limited to last for a period that is shorter than the correcting 

10 period simultaneously for all wavelength channels. The FEC can then effectively correct 
the dominating errors that occur and thereby improve system tolerance to PMD and 
prevent system outages. (BECP is in units of time, which equals burst error correction 
length (BECL) multiplied by the bit period. For the ITU standard G.709, the BECL = 
1024 bits. Thus, in a G.709 standardized 10.7-Gb/s system, the BECP is approximately 

15 1024 x lOOps-O.l^is.) 

FIGS. 1 A-D illustrate a working principle of present invention. FIGS. 1 A-B 
show the case without polarization scrambler modules (PSMs). As shown in FIGS. 1 A- 
B, PMD occasionally causes severe signal waveform distortion, which results in 
consecutive or very frequent errors. Such PMD-induced distortion typically last from 

20 milliseconds up to minutes. 

For any given FEC code, there is a maximum number of correctable errors per 
FEC frame (or block), N maxframe . There is also a maximum number of correctable 



Liu 28 6 

consecutive burst errors per FEC frame, N max burst , (which is referred to herein as the 
BECL, and is generally less or equal to N max framc ). FEC is unable to correct the errors 
(and may even generate more errors) when the errors occur so frequently that during each 
FEC frame period (normally on the order of microseconds) the number of errors exceeds 

5 N max frame , or occur consecutively for more than N max burst times. These events, during 
which a system fails (even with an allocated margin) because of PMD, are called PMD- 
induced outage events, and are illustrated in FIG. IB. 

Using PSMs, in accordance with aspects of the present invention, to scramble the 
PMD (experienced by a signal during transmission) during each FEC frame, redistributes 

10 the PMD to close to its original Maxwellian distribution, such that no consecutive errors 
(due to PMD) last longer than N max burst , as shown in FIG. 1C. By doing so, the errors are 
substantially uniformly distributed when looking at a time resolution of an FEC frame 
period, and can thus be effectively corrected by FEC, providing an appropriate system 
margin is allocated for PMD. It can be understood that the total number of errors (before 

15 FEC correction) over an infinite time period will be the same for the two cases without 
and with PSMs. The redistribution of the PMD effectively enables FEC to correct errors 
during what would otherwise be a PMD outage event, so the PMD induced OP is 
substantially reduced, as shown in FIG. IB. 

One embodiment of a system 20 in accordance with the invention is shown in 

20 FIG. 2. In operation, a high-speed signal (e.g., OC192) is first FEC encoded by an FEC 
encoder 201, and then used to modulate light from a light source 202, forming one 
wavelength channel 203 of an optical signal. A plurality of such channels are similarly 
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encoded and are multiplexed in a wavelength-division-multiplexer (WDM) 204 to form 
an optical signal for transmission across a link. The link comprises one or more 
transmission spans 205. The transmission spans 205 preferably comprise one or more 
transmission fiber spans 206, one or more optical amplifiers 207 (e.g., EDFAs), and, if 
5 necessary, dispersion compensating modules (DCMs, not shown). 

In the embodiment shown in FIG. 2, a polarization scrambler module (PSM) 208 
is positioned within the span 205. It can be understood by those skilled in the art that one 
or more PSMs 208 can be distributed along a link, (e.g., one or more PSMs can be added 
in one or more amplified spans). 

10 Another embodiment of a system employing FEC encoding (using 2 PSMs per 

node in a system with N nodes) in accordance with the invention is shown in FIG. 3. As 
used herein, the term 'node" refers to any point in a system or network having one or 
more PSMs. The two PSMs of each node are preferably driven at two different 
frequencies (as discussed further below). 

15 One embodiment of a PSM 400 in accordance with the present invention is shown 

in FIG. 4. As shown in FIG. 4, the PSM 400 may comprise a rotatable half-wave plate 
device 410 and drive circuitry 420. As can be understood by those skilled in the art the 
rotatable half- wave plate device 410 effectively serves as a polarization varying device or 
polarization controller. Alternatively, other devices may be used to vary or control the 

20 polarization of transmitted optical signals, including phase modulators, rotatable quarter- 
wave plates, and the like. As used herein, 'polarization controller' refers to any type of 
polarization varying device, including but not limited to those discussed above. 
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FIG. 5 is a diagram showing one embodiment of drive circuitry 500 which can be 
used in accordance with embodiments of the invention. As can be understood from FIG. 
5 5 a reference clock signal 501 is input into a processor 502 to generate substantially 
sinusoidal RF drive signals 504, 505. The processor 502 can be an integrated circuit, or 
5 any other device capable of generating one or more drive signals 504, 505. 

The drive signals 504, 505 preferably have the same frequency and may have a 
variable delay with respect to each other. The drive signals 504, 505 can be used to drive 
polarization controllers in accordance with the invention. 

Optical amplifiers 506, 507 may be used to amplify the drive signals 504, 505 
10 before driving a polarization controller 510 (shown in phantom). It is understood that the 
drive circuitry 500 may be integrated with the polarization controller 510, or may be a 
separate device/component. 

Additionally, those skilled in the art can appreciate that the drive circuitry 500 can 
be arranged to generate a single sinusoidal drive signal to drive a phase modulator based 
15 polarization controller device in accordance with alternate embodiments of the invention 
discussed herein. 

Referring again to FIG. 2, the PSMs 208 are preferably positioned along a link 
where the signal power is relatively high (e.g., after an optical amplifier) so that the 
OSNR degradation due to the loss from the PSMs is substantially minimized. It is also 
20 preferred that the PSMs 208 are substantially uniformly distributed along the link (e.g., 
spaced along the link based on PMD values of the spans within the link) so that the PMD 
is more effectively redistributed. 
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The PSM 208 can comprise, for example, a single-stage LiNb0 3 based phase 
modulator, or any other device that provides sufficient polarization scrambling. 
Preferably, at least two PSMs are driven at different frequencies (that are "relatively 
prime" to each other), and are used together in a given node to randomize signal 
5 polarization, independent of the polarization state of the input signal. "Relatively prime" 
as used herein refers to being relatively prime (i.e. they have no factors in common except 
for 1) with respect to the first two significant digits (after rounding) of the higher 
frequency. For example, 10.9 MHz and 8.2 MHz are considered herein to be relatively 
prime after rounding since 10.9 becomes 11 and 8.2 becomes 8, and 11 and 8 are 

10 relatively prime. As an additional example 15.2 and 30.1 are not considered relatively 
prime since after rounding 15 and 30 are not relatively prime. 

At the receiver side of the system 20, WDM channels of the transmitted optical 
signal are de-multiplexed by demultiplexer 210 and then individually detected at a 
receiver 220. The signals are then FEC decoded with an FEC decoder 230 to obtain the 

15 original data signal. 

The instant PMD of a link can be represented by a vector, Q, whose length equals 
the DGD between two principle states of polarization (PSPs) of the fiber link, and whose 
direction is aligned with the maximum delay PSP. Generally, the distribution of DGD 
follows Maxwellian distribution, as shown in the plot of FIG. 6A. At some rare 

20 occasions (toward the tail of the Maxwellian distribution), instant |Q| can be much larger 
than the average link DGD, Q (or DGD), resulting in a large penalty. Outage probability 
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(OP) is commonly used to assess the probability of having a PMD penalty larger than a 
pre-allocated amount (e.g., 2 dB in required OSNR). It is desirable to have the OP as 
small as possible. 

Numerical simulations have shown that OP can be reduced using distributed 
5 PSMs in accordance with embodiments of the invention. As illustrated in FIG. 6B 5 given 
an outage event during which the instant |Q 0 | = 3fi, |Q| is redistributed by inserting a 
PSM at the middle of the link. The new distribution is obtained as follows. We first find 
all the possible pairs of PMD vectors of the first and second half of the link, Q, and Q 2 , 
that satisfy Q,+Q 2 =Q, and their occurrence probabilities. The distributions of |Q,| and 

10 |Q 2 | are shown in FIGS. 6C-D. For each (Q, Q 2 ) pair, we rotate Q, on the Poincare 
sphere with all possible states uniformly sampled (to emulate the function of the PSM), 
and sum it with Q 2 to obtain a new PMD vector, Q new . The distribution of |Q new | is then 
obtained by calculating the relative probabilities of all the sampled DGD values and re- 
normalizing them. Clearly, the new distribution is no longer isolated around 3 o , but has 

15 a substantial portion around n . 

The above process was repeated for cases with 2 or more distributed PSMs 208. 
FIGS. 7 A-B show the new DGD distribution with 2 and 6 uniformly distributed PSMs, 
respectively. With an increased number of PSMs 208, the DGD distribution becomes 
closer to the original Maxwellian distribution. It can be appreciated by those skilled in 

20 the art that the DGD distribution of the i-th section, |Q S |, is likely to be distributed around 
|Q|/(N+1) (N being the total number of PSMs) providing that |Q|/(N+1)>q/(N+1) 1/2 , 
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since the Maxwellian distribution strongly favors that |Q S | be close to q/(N+1) ,/2 . With 
distributed polarization scrambling, the new Q can be seen as the quadratic summation of 
all the sectional PMD vectors, and its mean value can be approximated as 

Q,^*max(Q, |Q 0 |/V^TT). (1) 

5 As N becomes sufficiently large, the new mean PMD approaches Q . This 

qualitatively explains the convergence of the new link DGD distribution from an outage 
event to its original Maxwellian distribution through the use of distributed PSMs. 
PSM Speed Requirement For Outage Prevention 

To effectively redistribute the link DGD during an outage event to the original 

10 Maxwellian distribution, the speed requirement of PSMs 208 (which is closely related to 
the FEC code used and the system data rate) is an important parameter. Generally, an 
FEC code is capable of correcting N max frame maximum number of errors per FEC frame, 
and N max _ bU rst maximum number of consecutive burst errors. RS-FEC has an advantageous 
feature that N max burst equals N max _ frame . In one version of ITU's recommended FEC (G.709 

15 standard), RS (255,239) code with an interleaving depth of 16 is used, resulting in 
N max burst = N max_frame = 8x16 bytes (or 1024 bits). The corresponding burst-error-correcting- 
period (BECP) of is about 0.1 (is for 10-Gb/s systems (0.025 jus for 40-Gb/s systems). To 
change the state of polarization at least once during each BECP, the speed of the PSM 
needs to be greater than about 10 MHz, and greater than about 40 MHz, for 10-Gb/s and 

20 40-Gb/s systems, respectively. In systems where multiple data rates coexist (e.g., a 
system with 10 Gb/s channels mixed with 40-Gb/s channels), a minimal speed 
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requirement is preferably determined by the highest data rate. The speed requirement can 

be relaxed when PSMs are distributed in many nodes along the transmission link. The 

larger the number of the nodes (N), the lower the required minimum speed of each PSM. 

LiNb0 3 -based PSMs are capable of polarization scrambling with speeds of up to a few 
5 GHz, and may be used in accordance with the invention. Using advanced FEC codes 

with large burst-error-correction capability, the speed requirements of the PSMs 208 

speed may be relaxed. 

The performance improvement through the use of distributed PSMs was accessed 

and is discussed below. The PMD-induced OP assuming idealized or sufficient PMD 
10 scrambling which redistributed the link DGD to the original Maxwellian distribution was 

considered. It was understood that there is a small probability that PMD outages may 

occur even after the PMD scrambling through N nodes with PSM where the new link 

DGD is still large enough to cause system outage (or it is still larger than the specific |Q|). 

We can write the new OP (after sufficient PMD scrambling, OP sufficient ) as 
1 5 OP suFtcicnt (AT) = M{Q 4- [M~ x (OP 0 )-Q]- VivTT} , (2) 

where M(x) is the probability of obtaining DGD that is larger than x assuming the DGD 

is Maxwellian distributed with mean of q , or 

M _1 (y) is the inverse function of M(x). FIG. 8 shows the dependence of the new OP on N 
20 for assuming the original OP to be 10" 3 (dotted line for sufficient PMD scrambling). The 
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new OP is substantially reduced with the increase of N. More than ten orders of 
magnitude of reduction in OP may be achieved with about 10 PSMs. 

The performance of the outage prevention under insufficient polarization 
scrambling speed is of practical interest. The impact of insufficient scrambling speed is 
5 the reduction of the effective number of PSMs. Assuming the change of polarization 
caused by PSMs are sinusoidal and additive, we can extend Eq. (3) to take into 
consideration the impact as 

insufficient 

W^OP^ cieM (N-p) (4) 
where p is the ratio between the actual PSM speed and the required speed. For 
10 example, p=0.8 for a PSM with 8-MHz speed in 10-Gb/s systems using standard RS- 
FEC. The outage prevention performance with p=0.8 is shown with a dashed line in FIG. 
8. While insufficient PSM speed degrades the performance, OP can still be significantly 
reduced from 10" 3 to <10" 12 with -10 nodes installing PSM. As can be understood from 
the above results, the present invention provides for the effective elimination of PMD- 
15 induced system outages. 

Improvement of PMD Tolerance 

The dependence of the OSNR penalty on PMD is important in evaluating the 
system tolerance to PMD. FIGs. 9 A-B show the relative required OSNR (as compared 
to that without FEC and without PMD) for achieving a BER of 10" 15 as a function of 
20 mean PMD in non-return-to-zero (NRZ) on-off-keyed (OOK) and differential phase-shift- 
keying (DPSK) transmission systems, respectively. When no FEC is used, the decision 
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threshold and phase are optimized on a bit-by-bit basis or optimized for each case of 
instant PMD, assuming the PMD is slow varying and the receiver can track the change. 
An OSNR penalty of 2 dB occurs as the mean system DGD reaches about 17% of the bit 
period (T). When RS-FEC is used, the decision threshold and phase are optimized on a 
5 frame-by-frame basis for each mean PMD. FEC provides about a 6.5 dB improvement 
over the OSNR requirement. With the increase of PMD, there is substantial difference in 
PMD tolerance between the cases without and with PSMs. The PMD tolerance (at 2-dB 
penalty) of the NRZ-OOK system with FEC and PSMs is about 0.24T, about 70% larger 
than that with FEC but without PSMs. When RS-FEC is used in the NRZ-DPSK system, 

10 the PMD tolerances are about 0.1 7T and about 0.3 IT without and with distributed PSMs, 
respectively. The use of distributed PSMs almost doubles the PMD tolerance in systems 
with FEC. It is noted that such performance improvement cannot be achieved by simply 
putting a PSM at the transmitter, which cannot avoid "bad" PMDs. Also the PMD 
tolerance with FEC but without PSMs is smaller than without FEC. This is because of 

15 the "nonlinear" dependence of the corrected BER (by FEC) on the uncorrected BER, 
which normally results in a much larger increase in the corrected BER when the 
uncorrected BER, is only slightly increased (due to PMD), as shown in FIG. 10. It is thus 
very beneficial to take the advantage offered by PSMs in systems in which FEC is 
implemented. 

20 PMD tolerance can be further increased when more powerful FEC codes (i.e. 

those having a higher uncorrected BER threshold than RS-FEC for a given corrected 
BER) are used with the present invention, providing the criteria for sufficient PMD 
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scrambling is met. It can be understood by those skilled in the art that the present 
invention is applicable to systems and transmission methods employing various FEC 
codes including but not limited to Reed-Solomon codes, concatenated block codes, 
convolutional codes and codes with various interleaved depth. 
5 In addition, the present invention is also applicable to systems employing NRZ or 

return-to-zero (RZ) signal formatting, and/or OOK, DPSK, differential quadrature-phase- 
shift-keying (DQPSK) modulation formatting, or the like. 

Additionally, the tolerance to PDL and PDG can be significantly improved with 
the use of PSMs in systems with FEC. As discussed above with regard to PMD 

10 mitigation, the present invention is effective in substantially reducing the PDL and PDG 
induced outages by quickly redistributing the PDL and PDG to allow FEC to correct 
transmission errors, substantially reducing outage probability. 

Preferably the PSM speed (i.e. the frequency of the sinusoidal drive signals) is 
above about BR/(FEC-BECL x N) (e.g., 2 MHz for a 10 Gb/s system or 8 MHz for a 40 

15 Gb/s system with, 5 nodes with one or more PSMs and ITU G.709 recommended RS- 
FEC). 

We note that polarization scrambler modules according to the invention also 
scramble the phases of the signal bits, and polarization scrambling with very high speed 
(comparable to the data rate, BR) may cause large signal spectrum broadening (e.g., about 
20 two times the spectrum of the transmitted signal) and penalty. Preferably the PSM speed 
is below about BR / N (i.e. 1 GHz for a 10 Gb/s system or 4 GHz for 40 Gb/s system with 
10 nodes). 
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To take advantage of the FEC correction capability, it is desirable to limit the 
scrambling speed to below about BR / (8 x ID) 5 where ID is the interleaving depth of the 
FEC used in the system (note: 8 is the ratio between a byte (on which FEC processes 
data) and bit). For example, ITU Standard G.709 (ITU-T Recommendation 
5 G.709/Y.1331, "Interfaces for the optical transport network," February, 2001) 
recommends ID = 16. So for a 40 Gb/s system, the driver frequency is preferably less 
than about 300 MHz. 

In summary the preferred driver frequency is between about BR/(FEC-BECL x N) 
and the lesser of about BR / (8 x ID) and BR / N. 
10 Additionally, it can be appreciated by those skilled in the art that one advantage of 

the present invention over PMDC is that the present invention does not require 
polarization monitoring and feedback control, and can operate in a set-and-forget mode. 

Although the invention has been described with reference to illustrative 
embodiments, this description should not be construed in a limiting sense. Various 
15 modifications of the described embodiments, as well as other embodiments of the 
invention, which are apparent to persons skilled in the art to which the invention pertains, 
are deemed to lie within the principle and scope of the invention as expressed in the 
following claims. 



